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The Biotechnology Systems Branch of the Scientific and Technical 
Information Center (STIC) no errors detected. 
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ENTERED 
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[ Edited by: 



Realigned nucleic acid/amino acid numbers/text in cases where the sequence 
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Deleted: invalid beginning/end-of-file text ; page numbers 



Inserted mandatory headings/numeric identifiers, specifically: 
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RAW SEQUENCE LISTING v DATE: 11/03/2005 

PATENT APPLICATION: US/10/554 , 387 TIME: 12:33:35 

Input Set : A:\PTO.AMC.txt 

Output Set: N:\CRF4\11032005\J554387.raw 

3 <110> APPLICANT: Shaaltiel, Yoseph 

4 Baum, Gideon 

5 Sharon Hashmueli 

6 Ayala Lewkowicz 

7 Bartfeld, Daniel 

9 <12 0> TITLE OF INVENTION: PRODUCTION OF HIGH MANNOSE PROTEINS IN PLANT CULTURE 
11 <130> FILE REFERENCE: 30570 
C--> 13 <140> CURRENT APPLICATION NUMBER: US/10/554,387 
C--> 13 < 14 1> .CURRENT FILING DATE: 2005-10-25 

13 <160> NUMBER OF SEQ ID NOS : 14 

15 <170> SOFTWARE: Patent In version 3.3 

17 <210> SEQ ID NO: 1 

18 <211> LENGTH: 22 

19 <212> TYPE: PRT 

20 <213> ORGANISM: Artificial sequence 

22 <220> FEATURE: 

23 <223> OTHER INFORMATION: ER signal peptide 
25 <400> SEQUENCE: 1 

27 Met Lys Thr Asn Leu Phe Leu Phe Leu lie Phe Ser Leu Leu Leu Ser 

28 1 5 10 15 

31 Leu Ser Ser Ala Glu Phe 

32 20 

35 <210> SEQ ID NO: 2 

36 <211> LENGTH: 7 
3 7 <212> TYPE: PRT 

38 <213> ORGANISM: Artificial sequence 

40 <220> FEATURE: 

41 <223> OTHER INFORMATION: Vacuolar targeting signal from Tobacco chitinase A 
43 <400> SEQUENCE: 2 

45 Asp Leu Leu Val Asp Thr Met 

46 1 5 

49 <210> SEQ ID NO: 3 

50 <211> LENGTH: 21 

51 <212> TYPE: DNA 

52 <213> ORGANISM: Artificial sequence 

54 <22 0> FEATURE: 

55 <223> OTHER INFORMATION: Single strand DNA oligonucleotide 

57 <400> SEQUENCE: 3 

58 cagaattcgc ccgcccctgc a 21 

61 <210> SEQ ID NO: 4 

62 <211> LENGTH: 22 
6 3 <212> TYPE: DNA 

64 <213> ORGANISM: Artificial sequence 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/554,387 



DATE: 11/03/2005 
TIME: 12:33:35 



Input Set : A:\PTO.AMC.txt 

Output Set: N:\CRF4\11032005\J554387.raw 

66 <220> FEATURE: 

67 <223> OTHER INFORMATION: Single strand DNA oligonucleotide 

69 <400> SEQUENCE: 4 

70 ctcagatctt ggcgatgcca ca 22 

73 <210> SEQ ID NO: 5 

74 <211> LENGTH: 19 

75 <212> TYPE: DNA 

79 <213> ORGANISM: Artificial sequence 

81 <220> FEATURE: 

82 <223> OTHER INFORMATION: Single strand DNA oligonucleotide 

84 <400> SEQUENCE: 5 

85 ctcagaagac cagagggct 19 

88 <210> SEQ ID NO: 6 

89 <211> LENGTH: 17 

90 <212> TYPE: DNA 

91 <213> ORGANISM: Artificial sequence 

93 <220> FEATURE: 

94 <223> OTHER INFORMATION: Single strand DNA oligonucleotide 

96 <400> SEQUENCE: 6 

97 caaagcggcc atcgtgc 17 

100 <210> SEQ ID NO: 7 

101 <211> LENGTH: 1491 

102 <212> TYPE: DNA 

103 <213> ORGANISM: Homo sapiens 

105 <400> SEQUENCE: 7 

106 gcccgcccct gcatccctaa aagcttcggc tacagctcgg tggtgtgtgt ctgcaatgcc 60 
108 acatactgtg actcctttga ccccccgacc tttcctgccc ttggtacctt cagccgctat 120 
110 gagagtacac gcagtgggcg acggatggag ctgagtatgg ggcccatcca ggctaatcac 180 
112 acgggcacag gcctgctact gaccctgcag ccagaacaga agttccagaa agtgaaggga 240 
114 tttggagggg ccatgacaga tgctgctgct ctcaacatcc ttgccctgtc accccctgcc 300 
116 caaaatttgc tacttaaatc gtacttctct gaagaaggaa tcggatataa catcatccgg 360 
118 gtacccatgg ccagctgtga cttctccatc cgcacctaca cctatgcaga cacccctgat 420 
120 gatttccagt tgcacaactt cagcctccca gaggaagata ccaagctcaa gatacccctg 480 
122 attcaccgag ccctgcagtt ggcccagcgt cccgtttcac tccttgccag cccctggaca 540 
124 tcacccactt ggctcaagac caatggagcg gtgaatggga aggggtcact caagggacag 600 
126 cccggagaca tctaccacca gacctgggcc agatactttg tgaagttcct ggatgcctat 660 
128 gctgagcaca agttacagtt ctgggcagtg acagctgaaa atgagccttc tgctgggctg 720 
130 ttgagtggat accccttcca gtgcctgggc ttcacccctg aacatcagcg agacttcatt 780 
132 gcccgtgacc taggtcctac cctcgccaac agtactcacc acaatgtccg cctactcatg 840 
134 ctggatgacc aacgcttgct gctgccccac tgggcaaagg tggtactgac agacccagaa 900 
136 gcagctaaat atgttcatgg cattgctgta cattggtacc tggactttct ggctccagcc 960 
138 aaagccaccc taggggagac acaccgcctg ttccccaaca ccatgctctt tgcctcagag 1020 
140 gcctgtgtgg gctccaagtt ctgggagcag agtgtgcggc taggctcctg ggatcgaggg 1080 
142 atgcagtaca gccacagcat catcacgaac ctcctgtacc atgtggtcgg ctggaccgac 1140 
144 tggaaccttg ccctgaaccc cgaaggagga cccaattggg tgcgtaactt tgtcgacagt 1200 
146 cccatcattg tagacatcac caaggacacg ttttacaaac agcccatgtt ctaccacctt 1260 
148 ggccacttca gcaagttcat tcctgagggc tcccagagag tggggctggt tgccagtcag 1320 
150 aagaacgacc tggacgcagt ggcactgatg catcccgatg gctctgctgt tgtggtcgtg 1380 
152 ctaaaccgct cctctaagga tgtgcctctt accatcaagg atcctgctgt gggcttcctg 1440 
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154 


gagacaatct cacctggcta ctccattcac acctacctgt 


ggcatcgcca g 




157 


<210> SEQ ID NO: 


: 8 
























158 


<211> LENGTH: 335 
























159 


<212> TYPE: 


PRT 


























160 


<213> ORGANISM: 


Homo sapiens 


















162 


<4 00> SEQUENCE: 


8 
























164 


Ala 


Arg 


Pro 


Cys 


He 


Pro 


Lys 


Ser 


Phe Gly 


Tyr 


Ser 


Ser 


Val 


Val 


Cys 


165 


1 








5 










10 










15 




168 


Val 


Cys 


Asn 


Ala 


Thr 


Tyr 


Cys 


Asp 


Ser 


Phe 


Asp 


Pro 


Pro 


Thr 


Phe 


Pro 


169 








20 










25 










30 






172 


Ala 


Leu 


Gly 


Thr 


Phe 


Ser 


Arg 


Tyr 


Glu 


Ser 


Thr 


Arg 


Ser 


Gly 


Arg 


Arg 


173 






35 










40 










45 








176 


Met 


Glu 


Leu 


Ser 


Met 


Gly 


Pro 


lie 


Gin 


Ala 


Asn 


His 


Thr 


Gly 


Thr Gly 


177 




50 










55 










60 










180 


Leu 


Leu 


Leu 


Thr 


Leu 


Gin 


Pro 


Glu 


Gin 


Lys 


Phe 


Gin 


Lys 


Val 


Lys 


Gly 


181 


65 










70 










75 










80 


184 


Phe 


Gly 


Gly 


Ala 


Met 


Thr 


Asp 


Ala 


Ala 


Ala 


Leu 


Asn 


He 


Leu 


Ala 


Leu 


185 










85 










90 










95 




188 


Ser 


Pro 


Pro 


Ala 


Gin 


Asn 


Leu 


Leu 


Leu 


Lys 


Ser 


Tyr 


Phe 


Ser 


Glu 


Glu 


189 








100 










105 










110 






192 


Gly 


Val 


Arg 


Leu 


Leu 


Met 


Leu 


Asn Asp 


Gin 


Arg 


Leu 


Leu 


Leu 


Pro 


His 


193 






115 










120 










125 








196 


Trp 


Ala 


Lys 


Val 


Val 


Leu 


Thr 


Asp 


Pro 


Glu 


Ala 


Ala 


Lys 


Tyr 


Val 


His 


197 




130 










135 










140 










200 


Gly 


He 


Ala 


Val 


His 


Trp 


Tyr 


Leu 


Asp 


Phe 


Leu 


Ala 


Pro 


Ala 


Lys 


Ala 


201 


145 










150 










155 










160 


204 


Thr 


Leu 


Gly 


Glu 


Thr 


His 


Arg 


Leu 


Phe 


Pro 


Asn 


Thr 


Met 


Leu 


Phe 


Ala 


205 










165 










170 










175 




208 


Ser 


Glu 


Ala 


Cys 


Val 


Gly 


Ser 


Lys 


Phe 


Trp 


Glu 


Gin 


Ser 


Val 


Arg 


Leu 


209 








180 










185 










190 






212 


Gly 


Ser 


Trp 


Asp 


Arg 


Gly 


Met 


Gin 


Tyr 


Ser 


His 


Ser 


He 


He 


Thr 


Asn 


213 






195 










200 










205 








216 


Leu 


Leu 


Tyr 


His 


Val 


Val 


Gly 


Trp 


Thr 


Asp 


Trp 


Asn 


Leu 


Ala 


Leu 


Asn 


217 




210 










215 










220 










220 


Pro 


Glu 


Gly 


Gly 


Pro 


Asn 


Trp 


Val 


Arg Asn 


Phe 


Val 


Asp 


Ser 


Pro 


He 


221 


225 










230 










235 










240 


224 


He 


Val 


Asp 


He 


Thr 


Lys 


Asp 


Thr 


Phe 


Tyr 


Lys 


Gin 


Pro 


Met 


Phe 


Tyr 


225 










245 










250 










255 




232 


His 


Leu 


Gly 


His 


Phe 


Ser 


Lys 


Phe 


He 


Pro 


Glu 


Gly 


Ser 


Gin 


Arg 


Val 


233 








260 










265 










270 






236 


Gly 


Leu 


Val 


Ala 


Ser 


Gin 


Lys 


Asn 


Asp 


Leu 


Asp 


Ala 


Val 


Ala 


Leu 


Met 


237 






275 










280 










285 








240 


His 


Pro 


Asp 


Gly 


Ser 


Ala 


Val 


Val 


Val 


Val 


Leu 


Asn 


Arg 


Ser 


Ser 


Lys 


241 




290 










295 










300 










244 


Asp 


Val 


Pro 


Leu 


Thr 


He 


Lys 


Asp 


Pro 


Ala 


Val 


Gly 


Phe 


Leu 


Glu 


Thr 


245 


305 










310 










315 










320 


248 


He 


Ser 


Pro 


Gly 


Tyr 


Ser 


He 


His 


Thr 


Tyr 


Leu 


Trp 


His 


Arg 


Gin 




249 










325 










330 










335 




252 


<210> SEQ ID NO: 


: 9 
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253 


<211> LENGTH: 338 






254 


<212> TYPE: DNA 






255 


<213> ORGANISM: Cauliflower mosaic virus 






257 


<400> SEQUENCE: 9 






258 


ttttcacaaa gggtaatatc gggaaacctc ctcggattcc 


attgcccagc tatctgtcac 


60 


260 


ttcatcgaaa ggacagtaga aaaggaaggt ggctcctaca 


aatgccatca ttgcgataaa 


120 


262 


ggaaaggcta tcgttcaaga tgcctctacc gacagtggtc 


ccaaagatgg acccccaccc 


180 


264 


acgaggaaca tcgtggaaaa agaagacgtt ccaaccacgt 


cttcaaagca agtggattga 


240 


266 


tgtgatatct ccactgacgt aagggatgac gcacaatccc 


actatccttc gcaagaccct 


300 


268 


tcctctatat aaggaagttc atttcatttg gagaggac 




338 


271 


<210> SEQ ID NO: 10 






272 


<211> LENGTH: 66 






273 


<212> TYPE: DNA 






274 


<213> ORGANISM: Artificial sequence 






276 


<220> FEATURE: 






277 


<223> OTHER INFORMATION: Nucleic acid sequence encoding the ER signal 


peptide 


279 


<400> SEQUENCE: 10 






280 


atgaagacta atctttttct ctttctcatc ttttcacttc 


tcctatcatt atcctcggcc 


60 


282 


gaattc 




66 


285 


<210> SEQ ID NO: 11 






286 


<211> LENGTH: 21 






287 


<212> TYPE: DNA 






288 


<213> ORGANISM: Artificial sequence 






290 


<220> FEATURE: 






291 


<22 3> OTHER INFORMATION: Nucleic acid sequence encoding the vacuolar 


targeting 


sequence 








293 


<400> SEQUENCE: 11 






294 


gatcttttag tcgatactat g 




21 


297 


<210> SEQ ID NO: 12 






298 


<211> LENGTH: 167 






299 


<212> TYPE: DNA 






300 


<213> ORGANISM: Artificial sequence 






302 


<22 0> FEATURE: 






303 


<223> OTHER INFORMATION: Nucleic acid sequence of the Agrobacterium tumefaciens 



terminator 



306 


<220> FEATURE: 








307 


<221> NAME/KEY: misc_feature 








308 


<222> LOCATION: (162).. (162) 








309 


<223> OTHER INFORMATION: n is a, 


c, g, or t 






311 


<400> SEQUENCE: 12 








312 


taatttcatg atctgttttg ttgtattccc 


ttgcaatgca 


gggcctaggg ctatgaataa 


60 


314 


agttaatgtg tgaatgtgtg aatgtgtgat 


tgtgacctga 


agggatcacg actataatcg 


120 


W--> 316 


tttataataa acaaagactt tgtcccaaaa 


accccccccc 


cngcaga 


167 


319 


<210> SEQ ID NO: 13 








320 


<211> LENGTH: 2186 








321 


<212> TYPE: DNA 








322 


<213> ORGANISM: Artificial sequence 






324 


<220> FEATURE: 








325 


<223> OTHER INFORMATION: nucleic 


acid sequence encoding high mannose 


human 


326 


glucocerebrosidase (GCD) 








329 


<220> FEATURE: 
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RAW SEQUENCE LISTING DATE: 11/03/2005 

PATENT APPLICATION: US/10/554,387 TIME: 12:33:35 

Input Set : A:\PTO.AMC.txt 

Output Set: N:\CRF4\11032005\J554387.raw 

330 <221> NAME/KEY: misc_f eature 

331 <222> LOCATION: (2181) (2181) 

332 <223> OTHER INFORMATION: n is a, c, g, or t 

334 <400> SEQUENCE: 13 

335 ttttcacaaa gggtaatatc gggaaacctc ctcggattcc attgcccagc tatctgtcac 60 
337 ttcatcgaaa ggacagtaga aaaggaaggt ggctcctaca aatgccatca ttgcgataaa 120 
339 ggaaaggcta tcgttcaaga tgcctctacc gacagtggtc ccaaagatgg acccccaccc 180 
341 acgaggaaca tcgtggaaaa agaagacgtt ccaaccacgt cttcaaagca agtggattga 240 
343 tgtgatatct ccactgacgt aagggatgac gcacaatccc actatccttc gcaagaccct 300 
345 tcctctatat aaggaagttc atttcatttg gagaggacag gcttcttgag atccttcaac 360 
347 aattaccaac aacaacaaac aacaaacaac attacaatta ctatttacaa ttacagtcga 420 
349 gggatccaag gagatataac aatgaagact aatctttttc tctttctcat cttttcactt 480 
351 ctcctatcat tatcctcggc cgaattcgcc cgcccctgca tccctaaaag cttcggctac 540 
353 agctcggtgg tgtgtgtctg caatgccaca tactgtgact cctttgaccc cccgaccttt 600 
355 cctgcccttg gtaccttcag ccgctatgag agtacacgca gtgggcgacg gatggagctg 660 
357 agtatggggc ccatccaggc taatcacacg ggcacaggcc tgctactgac cctgcagcca 720 
359 gaacagaagt tccagaaagt gaagggattt ggaggggcca tgacagatgc tgctgctctc 780 
361 aacatccttg ccctgtcacc ccctgcccaa aatttgctac ttaaatcgta cttctctgaa 840 
363 gaaggaatcg gatataacat catccgggta cccatggcca gctgtgactt ctccatccgc 900 
365 acctacacct atgcagacac ccctgatgat ttccagttgc acaacttcag cctcccagag 960 
367 gaagatacca agctcaagat acccctgatt caccgagccc tgcagttggc ccagcgtccc 1020 
369 gtttcactcc ttgccagccc ctggacatca cccacttggc tcaagaccaa tggagcggtg 1080 
3 71 aatgggaagg ggtcactcaa gggacagccc ggagacatct accaccagac ctgggccaga 1140 
373 tactttgtga agttcctgga tgcctatgct gagcacaagt tacagttctg ggcagtgaca 1200 
380 gctgaaaatg agccttctgc tgggctgttg agtggatacc ccttccagtg cctgggcttc 1260 
382 acccctgaac atcagcgaga cttcattgcc cgtgacctag gtcctaccct cgccaacagt 1320 
384 actcaccaca atgtccgcct actcatgctg gatgaccaac gcttgctgct gccccactgg 1380 
386 gcaaaggtgg tactgacaga cccagaagca gctaaatatg ttcatggcat tgctgtacat 1440 
388 tggtacctgg actttctggc tccagccaaa gccaccctag gggagacaca ccgcctgttc 1500 
3 90 cccaacacca tgctctttgc ctcagaggcc tgtgtgggct ccaagttctg ggagcagagt 1560 
3 92 gtgcggctag gctcctggga tcgagggatg cagtacagcc acagcatcat cacgaacctc 162 0 
3 94 ctgtaccatg tggtcggctg gaccgactgg aaccttgccc tgaaccccga aggaggaccc 1680 
396 aattgggtgc gtaactttgt cgacagtccc atcattgtag acatcaccaa ggacacgttt 1740 
398 tacaaacagc ccatgttcta ccaccttggc cacttcagca agttcattcc tgagggctcc 1800 
400 cagagagtgg ggctggttgc cagtcagaag aacgacctgg acgcagtggc actgatgcat 1860 
402 cccgatggct ctgctgttgt ggtcgtgcta aaccgctcct ctaaggatgt gcctcttacc 1920 
404 atcaaggatc ctgctgtggg cttcctggag acaatctcac ctggctactc cattcacacc 1980 
406 tacctgtggc atcgccaaga tcttttagtc gatactatgt aatttcatga tctgttttgt 2040 
408 tgtattccct tgcaatgcag ggcctagggc tatgaataaa gttaatgtgt gaatgtgtga 2100 
410 atgtgtgatt gtgacctgaa gggatcacga ctataatcgt ttataataaa caaagacttt 2160 

W--> 412 gtcccaaaaa cccccccccc ngcaga 2186 

415 <210> SEQ ID NO: 14 

416 <211> LENGTH: 526 

417 <212> TYPE: PRT 

418 <213> ORGANISM: Artificial sequence 

420 <22 0> FEATURE: 

421 <223> OTHER INFORMATION: High mannose human glucocerebrosidase (GCD) 
423 <400> SEQUENCE: 14 

425 Met Lys Thr Asn Leu Phe Leu Phe Leu lie Phe Ser Leu Leu Leu Ser 
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RAW SEQUENCE LISTING ERROR SUMMARY 
PATENT APPLICATION: US/10/554 , 3 87 



DATE: 11/03/2005 
TIME: 12:33:36 



Input Set : A:\PTO.AMC.txt 

Output Set: N:\CRF4\11032005\J554387.raw 



Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:12; N Pos . 162 
Seq#:13; N Pos. 2181 
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VERIFICATION SUMMARY DATE: 11/03/2005 

PATENT APPLICATION: US/10/554,3 87 TIME: 12:33:36 



Input Set : A:\PTO.AMC.txt 

Output Set: N:\CRF4\11032005\J554387.raw 

L:13 M:270 C: Current Application Number differs, Replaced Current Application No 
L:13 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:316 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:12 after pos.:120 
L:412 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:13 after pos.:2160 
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RAW SEQUENCE LISTING DATE: 11/01/2005 

PATENT APPLICATION: US/10/554 , 387 TIME: 11:00:57 

Input Set : A:\PTO.RJ.txt 

Output Set: N:\CRF4\11012005\J554387.raw 





3 


<110> 


APPLICANT: Shaaltiel, Yoseph 




4 




Baum, Gideon 




5 




Sharon Hashmueli 




6 




Ayala Lewkowicz 




7 




Bartfeld, Daniel 




9 


<120> 


TITLE OF INVENTION: PRODUCTION OF HIGH MANNOSE PROTEINS IN PLANT CULTURE 




11 


<130> 


FILE REFERENCE: 30570 


c--> 


13 


<140> 


CURRENT APPLICATION NUMBER: US/10/554,387 


c--> 


13 


<141> 


CURRENT FILING DATE: 2005-10-25 




13 


<160> 


NUMBER OF SEQ ID NOS : 14 




15 


<170> 


SOFTWARE: Patentln version 3.3 



ERRORED SEQUENCES ®OQ$ f^Q, fv. 



162 


<210> SEQ ID NO 


8 
















163 


<211> LENGTH: 335 
















164 


<212> TYPE: 


PRT 












P 






165 


<213> ORGANISM: 


Homo sapiens 










167 


<400> SEQUENCE: 


8 
















169 


Ala 


Arg 


Pro 


Cys 


He 


Pro 


Lys 


Ser Phe Gly Tyr Ser 


Ser 


Val 


Val 


Cys 


170 


1 








5 






10 






15 




173 


Val 


Cys 


Asn 


Ala 


Thr 


Tyr 


Cys 


Asp Ser Phe Asp Pro 


Pro 


Thr 


Phe 


Pro 


174 








20 








25 




30 






177 


Ala 


Leu 


Gly 


Thr 


Phe 


Ser 


Arg 


Tyr Glu Ser Thr Arg 


Ser Gly 


Arg Arg 


178 






35 










40 


45 








181 


Met 


Glu 


Leu 


Ser 


Met 


Gly 


Pro 


He Gin Ala Asn His 


Thr Gly 


Thr Gly 


182 




50 










55 


60 










185 


Leu 


Leu 


Leu 


Thr 


Leu 


Gin 


Pro 


Glu Gin Lys Phe Gin 


Lys 


Val 


Lys 


Gly 


186 


65 










70 




75 








80 


189 


Phe 


Gly 


Gly 


Ala 


Met 


Thr 


Asp 


Ala Ala Ala Leu Asn 


He 


Leu 


Ala 


Leu 


190 










85 






90 






95 




193 


Ser 


Pro 


Pro 


Ala 


Gin 


Asn 


Leu 


Leu Leu Lys Ser Tyr 


Phe 


Ser 


Glu 


Glu 


194 








100 








105 




110 






197 


Gly 


Val 


Arg 


Leu 


Leu 


Met 


Leu 


Asn Asp Gin Arg Leu 


Leu 


Leu 


Pro 


His 


198 






115 










120 


125 








201 


Trp 


Ala 


Lys 


Val 


Val 


Leu 


Thr 


Asp Pro Glu Ala Ala 


Lys 


Tyr 


Val 


His 


202 




130 










135 


140 










205 


Gly 


He 


Ala 


Val 


His 


Trp 


Tyr 


Leu Asp Phe Leu Ala 


Pro 


Ala 


Lys 


Ala 


206 


145 










150 




155 








160 


209 


Thr 


Leu 


Gly 


Glu 


Thr 


His 


Arg 


Leu Phe Pro Asn Thr 


Met 


Leu 


Phe 


Ala 


210 










165 






170 






175 




213 


Ser 


>Glu 


Ala 


Cys 


Val 


Gly 


Ser 


Lys Phe Trp Glu Gin 


Ser 


Val 


Arg 


Leu 
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^ x *± 








180 










185 








190 








2 17 


Gly Ser Trp Asp 


Arg Gly 


Met 


Gin 


Tyr Ser 


His 


Ser 


He 


He 


Thr 


Asn 




9ift 
z io 






195 










200 








205 










A. X 


Leu 


Leu Tyr 


His 


Val 


Val 


Glv 


Trn 
XJ -tr 


Thr Aso 


Trr> 


Asn 


Leu 


Ala 


Leu 


Asn 




999 
£ <c. z. 




210 










215 








220 












99R 


Pro Glu Gly Gly 


Pro 


Asn 


Trn 


Val 


Arcr Asn 


Phe 


Val 


Asp 


Ser 


Pro 


He 




99fi 


225 










9*^0 

Zj J \J 








235 










240 




9 9 Q 


He 


Val 


Asp 


lie 


Thr 


T,vc 


Asp 


Thr 


Phe Tyr 


Lys 


Gin 


Pro 


Met 


Phe 




rS — > 


9 1 Q 












q »- 0 7<v0 














9 ^ *7 


His 


Leu Gly His 


Phe 


Ser 


T.vC 

xjy 0 




T "I ^ Prn 

XX" IT X \J 


OX Li 


Gly Ser Gin Arg Val 


*S — " ? 


A o o 








260 










265 








270 








O A 1 


Gly Leu Val Ala 


Ser 


Gin 


Lys 


Asn 


Asp Leu 




Ala 


Val 


Ala 


Leu 


Met 




949 






275 










280 








285 












His 


Pro Asp Gly 


Ser 


Ala 


Val 


Val 


Val Val 


Leu 


Asn Arg 


Ser 


Ser 


Lys 


rS — > 


rfSfiO 




290 










295 








300 












z4 y 


Asp 


Val 


Pro 


Leu 


Thr 


He 


Lys 


Asp 


Pro Ala 


Val 


Gly Phe Leu Glu 


Thr 


K- - > 


Z DU 


305 










310 








315 










320 




OCT 


He Ser Pro Gly 


Tyr 


Ser 


He 


His 


Thr Tyr 


Leu 


Trp His Arg Gin 




IS ~ ~ > 












325 








33.0 










335 






/IOC 


<210> SEQ ID NO: 


: 14 
























A O *7 
ftZ / 


<211> LENGTH: 526 
























A O Q 


<212> TYPE: 


PRT 


























A 9 Q 
ft £t 1? 


<213> ORGANISM: 


Artificial sequence ff 
















A O 1 


<220> FEATURE: 




























<223> OTHER 


INFORMATION: 


: High mannose human 


glucocerebrosidase (G 




/I "3 >1 


<400> SEQUENCE: 


14 
























41 


Met 


Lys 


Thr 


Asn 


Leu 


Phe 


Leu 


Phe 


Leu He 


Phe 


Ser 


Leu 


Leu 


Leu 


Ser 




A "J *7 
4J / 


1 








5 








10 










15 






AAA 


Leu 


Ser 


Ser 


Ala 


Glu 


Phe Ala Arg 


Pro Cys 


He 


Pro 


Lys 


Ser 


Phe 


Gly 




A A 1 
ft ft X 








20 










25 








30 








444 


Tyr 


Ser 


Ser 


Val 


Val 


Cys 


Val 


Cys 


Asn Ala 


Thr 


Tyr 


Cys 


Asp 


Ser 


Phe 




ft ft o 






35 










40 








45 










44ft 


Asp 


Pro 


Pro 


Thr 


Phe 


Pro 


Ala 


Leu 


Gly Thr Phe 


Ser 


Arg 


Tyr 


Glu 


Ser 




44 Q 
ft ft j 




50 










55 








60 












4^9 


Thr 


Arg 


Ser 


Gly 


Arg 


Arg Met 


Glu 


Leu Ser 


Met 


Gly 


Pro 


He 


Gin 


Ala 




4 R"* 
ft O J 


65 










70 








75 










80 




456 


Asn 


His 


Thr 


Gly 


Thr 


Gly Leu Leu 


Leu Thr 


Leu 


Gin 


Pro 


Glu 


Gin 


Lys 




457 










85 








90 










95 






460 


Phe 


Gin 


Lys 


Val 


Lys 


Gly 


Phe 


Gly 


Gly Ala Met 


Thr 


Asp 


Ala 


Ala 


Ala 




461 








100 










105 








110 








469 


Leu 


Asn 


He 


Leu 


Ala 


Leu 


Ser 


Pro 


Pro Ala 


Gin 


Asn 


Leu 


Leu 


Leu 


Lys 




470 






115 










120 








125 










473 


Ser 


Tyr 


Phe 


Ser 


Glu 


Glu Gly He 


Gly Tyr Asn 


He 


He 


Arg 


Val 


Pro 




474 




130 










135 








140 












477 


Met 


Ala 


Ser 


Cys 


Asp 


Phe 


Ser 


He 


Arg Thr Tyr 


Thr 


Tyr 


Ala 


Asp 


Thr 




478 


145 










150 








155 










160 




481 


Pro 


Asp 


Asp 


Phe 


Gin 


Leu 


His 


Asn 


Phe Ser 


Leu 


Pro 


Glu 


Glu 


Asp 


Thr 




482 










165 








170 










175 






485 


Lys 


Leu 


Lys 


He 


Pro 


Leu 


He 


His 


Arg Ala Leu 


Gin 


Leu 


Ala 


Gin 


Arg 
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486 

*± o o 


180 






185 


190 






489 


pro Val Spr L°u 


Leu 


Ala Ser Pro 


Trp Thr Ser Pro Thr Trp 


Leu 


Lvs 


490 

*± -7 \J 


195 




200 


205 








493 


Thr Asn Glv Ala 


Val 


Asn Gly Lys 


Gly Ser Leu Lys Gly Gin 


Pro 


Glv 


4Q4 


9*1 0 

A X V 




215 


220 








497 


At?n Tip Tvr His 


Gin 


Thr Trp Ala Arg Tyr Phe Val Lys 


Phe 


Leu 


Asp 


*± ;? O 


99c 
D 




230 


235 






240 


R01 
DUX 


nla lyi nia uiu 


Hi s 
nx 0 


Lys Leu Gin 


Phe Trp Ala Val Thr Ala 


Glu 


Asn 


DUz 




94c 
Z -± D 




250 




£t Zj 




cnc 

DUD 


n Prn Car 7\ "1 0 

uiu riu oci rvxa 


ox y 


Leu Leu Ser Gly Tyr Pro Phe Gin 


Cys 


Leu 


Glv 

vjxy 


DUO 








265 


9 70 

Z / VJ 






ROQ 


DV10 T'Viy* Dta fZl n 
ir lie inx ri u ox u. 


Hi s 


Gin Arg Asp 


Phe He Ala Arg Asp 


T.01 1 
UC LL 


Glv 


Pro 


DXU 


97c 

Z I D 




280 


285 








DlO 


lilx IicU nla noli 


Cot* 
OCX 


Thr His His Asn Val Arg Leu Leu 


L. 


T.01 1 

i-J~ LI 


Asp 


DI-1 


0 on 
z y u 




295 


300 










7\ cn nl n 7\ yti T.011 
Ulll niy ucu 


T.01 1 

iJC LI 


Leu Pro His 


Trp Ala Lys Val Val 


T.OI 1 
XlC Li 


Thr 


Asp 


CI o 

D JLo 


one 




310 


315 






O Z V 


R9 1 
j^i 


Drrt CX~\ n A 1 a AT » 
riu ulU nla nia 


xi y 0 


Tyr Val His 


Gly He Ala Val His 


lip 


± y x 


Leu 


DZ Z 




325 




330 




ooc 






nbp rile JjcU nlcl 


Pro 


Ala Lys Ala Thr Leu Gly Glu Thr 


His 


niy 


Leu 


dz b 


OAO 






345 


350 








±rilc rlt) noil lilx 


Met 


Leu Phe Ala 


Ser Glu Ala Cys Val 


Gly 


OCX. 


Lys 


con 


occ 

j D D 




360 


365 








coo 
D J D 


flic lip olU Vjrlil 


Ser 


Val Arg Leu Gly Ser Trp Asp Arg 


Gly 


11C L. 


Gin 


CO A 


oth 




375 


380 








con 
DJ / 


fpi/y Cot* W*i c Cot* 

iyi ocx nib ocx 


He 


He Thr Asn 


Leu Leu Tyr His Val 


Val 


Glv 


xxp 


CTQ 


OQC 

J OD 




390 


395 






400 


541 


TVit A C3*n Tm Asn 

111L noL/ j. a. k/ noil 


Leu 


Ala Leu Asn 


Pro Glu Gly Gly Pro 


Asn 


TrD 


Val 


542 




405 




410 




415 




548 


Arg Asn Phe Val 


Asp 


Ser Pro He 


He Val Asp He Thr 


Lys 


Asp 


Thr 


549 


420 




425 


430 






552 


Phe Tyr Lys Gin 


Pro 


Met Phe Tyr 


His Leu Gly His Phe 


Ser 


Lys 


Phe 


553 


435 




440 


445 








556 


lie Pro Glu Gly 


Ser Gin Arg Val 


Gly Leu Val Ala Ser 


Gin 


Lys 


Asn 


557 


450 




455 


460 








560 


Asp Leu Asp Ala 


Val 


Ala Leu Met 


His Pro Asp Gly Ser 


Ala 


Val 


Val 


561 


465 




470 


475 






480 


564 


Val Val Leu Asn 


Arg Ser Ser Lys Asp Val Pro Leu Thr 


He 


Lys 


Asp 


565 




485 




490 




495 




568 


Pro Ala Val Gly 


Phe 


Leu Glu Thr 


He Ser Pro Gly Tyr 


Ser 


He 


His 




505 

Leu Trp His Arg Gin Asp Leu Leu Val 

52 0 



510 

Asp Thr Met 
525 
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L:13 M:2 
L:13 M:2 
L:235 M: 
L:238 M: 
L:242 M: 
L:246 M: 
L:250 M: 
L:254 M: 
L:327 M: 
L:423 M: 
L:580 M: 
L:582 M: 
L:585 M: 
L:585 M: 
L:588 M: 



70 C: 

71 C: 
332 E 
332 E 
332 E 
332 E 
332 E 
332 E 
341 W 
341 W 
332 E 

332 E 

333 E 
330 E 
252 E 
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Current Application Number differs, Replaced Current Application No 
Current Filing Date differs, Replaced Current Filing Date 
(32) Invalid/Missing Amino Acid Numbering, SEQ ID: 8 
(32) Invalid/Missing Amino Acid Numbering, 
(32) Invalid/Missing Amino Acid Numbering, 
(32) Invalid/Missing Amino Acid Numbering, 
(32) Invalid/Missing Amino Acid Numbering, 
(32) Invalid/Missing Amino Acid Numbering, 
(46) "n" or "Xaa" used, for SEQ 
n" or "Xaa" used, for SEQ 



SEQ 
SEQ 
SEQ 
SEQ 
SEQ 

ID#:12 after pos.:120 
ID#:13 after pos.:2160 



ID:8 
ID:8 
ID: 8 
ID:8 
ID:8 



(46) 

(32) Invalid/Missing Amino Acid Numbering, SEQ ID: 14 
(32) Invalid/Missing Amino Acid Numbering, SEQ ID: 14 
Wrong sequence grouping, Amino acids not in groups! 
(2) Invalid Amino Acid Designator, NUMBER OF INVALID KEYS:1 
No. of Seq. differs, <211> LENGTH: Input : 526 Found: 527 SEQ: 14 
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